|
|
Accession Number |
TCMCG048C30246 |
gbkey |
CDS |
Protein Id |
XP_031251164.1 |
Location |
join(2043129..2043247,2043626..2043770,2044170..2044267,2044872..2044947,2045203..2045322,2046066..2046113,2047511..2047575,2048729..2048814,2049339..2049503,2050295..2050404,2050705..2050810,2050906..2050981,2051615..2051673,2051785..2051852,2052531..2052704,2053813..2054019,2054974..2055230,2055304..2055447,2056852..2056909,2057177..2057476,2057693..2058424,2058739..2058966) |
Gene |
LOC116109048 |
GeneID |
116109048 |
Organism |
Pistacia vera |
|
|
Length |
1146aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA578116 |
db_source |
XM_031395304.1
|
Definition |
DNA mismatch repair protein MSH1, mitochondrial isoform X1 [Pistacia vera] |
CDS: ATGTACTGGCTAGCTACCAGAAACGCTGCCGTTACATTCCCAAAGTTGCGCTCACTTTCTCTGTTCCTTCGTTCCCCTGTTCTCAGTCACACTCCATTTCGCCCCTCCTCACTGTTTCTCAGCCGCAGACAGTTTGGTCAGATTTATTTTTTCAAAGACCGGAAGGCTTCGAGAGGGATTACGAAAGCTTCGAAGAAAGTTAAAGTATCAAATGATAATGTTCTAAATGACAAGGATCTTTCCCACTTAATGTGGTGGCAAGAGAGGCTACAAACTTGCCGGAAACCTTCTACTCTTCATCTGGTTAAAAGGCTCAAATATTCCAATTTGCTCGGCTTGGATGTTAGCTTGAAAAATGGGAGTCTAAAAGAAGGAACACTCAATTGGGAGATGTTGCAGTTCAAGTCTAAGTTTCCTCGTGAAGTTCTGCTCTGCAGAGTTGGAGATTTTTATGAAGCCATTGGAATAGATGGTTGTATTCTTGTTGAGTTTGCTGGTTTGAATCCATTTGGTGGTTTGCGTCCGGAGAGCATACCAAAAGCCGGTTGCCCTGTCGTGAATCTACGACAAACTTTGGATGATTTGACACGTAATGGCTATTCAGTTTGCATAGTGGAGGAAGTTCAAGGTCCAACACAAGCTCGTTCTCGTAAAAGCCGTTTTATATCTGGGCATGCACATCCAGGCAGTCCTTATGTATTTGGACTTGTTGGAATTGATCACGATCTTGACTTTCCAGAGCCAATGCCTGTTGTTGGAATATCTCGTTCGGCAAGGGGGTATTGCATAATTTTAGTCTTAGAAACTATGAAGACATATTGTTCAGAGGATGGTCTAACTGAAGATGCTTTAGTTACCAAGCTACGTACTTGTCGATACCATCACCTATTTCTGCATACATCCTTGAGACAGAATACATCAGGAACTTGCCGTTGGGGAGAATATGGTGAGGGAGGCCTACTGTGGGGAGAATGTAATTCCAGACTTTTTCAATGGTTTGAAGGCAATCCAGTCACTGACCTTTTGTTTAAGGTGAAGGAAGTTTATGGTCTTGAAAATGAAGTTACATTCAGAAATGTTACTGTGCCTTATGAAAATAGGCCCCGCCCTTTATATCTAGGAACAGCCACACAAATTGGTGCCATACCAACTGAGGGAATACCATCTTTGTTAAAGGTGTTGCTTCCATCAAACTGCACTGGACTACCTGTATTGTATGTCAGAGATCTTCTCCTCAATCCTCCTGCTTATGAGATTGCATCCACAATTCAAGCAATAGGCAAACTTATGAGCAACGTGACATGTTCAATTCCTGAGTTTACATGTGTTTCACCTGCCAAGCTTGTGAAGCTACTTGAATTGAGGGAGGCCAATCATATTGAGTTTTGTAGAATAAAAAATGTACTTGATGAAACCTTGCACATGTATGGAAACTCTGAGCTTAATGAAATCCTGAAATTGTTGATGGATCCAACCTGGGTTTCAACAGGGTTGAAAATTGACTTTGAGACATTTGTTAAGGAATGCGAATGTGCTTCAGTCAGAATTGGTGAAATGATCTCTCTTGATGGTGAAAGTGATCAAAAGATAAGTTCCTATAATGGCATTCCGAGTGATTTTTTTGAGGATATGGAATGTCTGTGGAAAAGGCGTGTGAAGAGGACCCACATTGAAGAAGAAATTGCAGAAGTCGAAAAGGCTGCTGAGGCCTTGTCATTAGCAGTTACTGAAGATTTTCTTCCTATTTTCTCAAGAATAAAAGCTACTACAGCCCCACTTGGTGGTCCAAGGGGGGAAATTTTATACGCTAGAGAGCATGAAGCTGTATGGTTTAAGGGAAGGCGATTTAGACCAGCAGTATGGGCTGGTACCCCTGGGGAAGAACAAATTAAACAGCTTAAGCCCGCTATAGATTCTAAAGGTAGAAAAGTTGGAGAAGAATGGTTTAGCACAATGAAGGTTGAAGATGCTTTATTGAGGTACCATGAGGCAGGTGCCAAGGCAAAAGCAAAGGTCTTGGAATTGTTGAGAGGACTTTCTTCAGAGTTGCAAACTAAAATTAATATCCTTGTGTTTGCTTCAATGCTTCTTGTTATTGCAAAGGCATTATTTGCTCATGTGAGTGAAGGTAGGAGAAGGAAATGGGTTTTCCCTACACTTGTTGGGTTCAACAGTTCTGAGACTATAAAACCACTCAACGGAGCAAATAGCTTGAAGATGATTGGTTTATCACCATATTGGTTTGATGTAGCTGAAGGCAGTGCTATTCATAATACAGTTGACATGCAGTCATTGTTTCTCTTGACGGGTCCAAATGGGGGTGGTAAATCTAGTTTGCTGCGATCCATTTGTGCTGCTGCATTACTTGGAATATGTGGATTTATGGTGCCTGCAGAGTCAGCCTTAATTCCTTACTTAGATGCTATTATGCTCCACATGAAATCTTATGATAGCCCTGCTGATGGAAAAAGTTCATTTCAGGTAGAAATGTCTGAGGTTCGGTCTATTATTACTGGAGCAACTTCCAGAAGTCTTGTGCTCATAGATGAAATTTGTCGAGGAACAGAAACAGCGAAAGGAACCTGTATTGCTGGTAGTATTATTGAGACTCTTGATAAAATTGGTTGTCTAGGAATTGTGTCCACTCACTTGCATGGAATCTTTGATTTACCGCTTAATACCAAGAACACCATGTACAAAGCAATGGGAACAGAATATGTTGATGGAAAGACAAAACCAACTTGGAAGTTAATAGATGGGATCTGTAGAGAAAGCCTTGCATTTGAAACAGCTAAAAAGGAAGGAGTTCCTGAGACAATAATTCAAAGAGCTGAAGACCTTTATCTGTCGGTTTATGCAAAAGAGAATTCTTCAGAAAGAAGTGACAGGAAAGGATCGCAAGTGTGTTCTGAAACAAGGATTGATTGTTCTGTTGAAGCTGATCTCCATTTTAATAACATTGGTGTGGGATCTGTCCATCATAAGATTGAGTCAATGATGACAATGGAAGTCTTACATAAGGAAATCAATAGAGCTGTCACTGTAATTTGTCAGAAGAAGTTGATTGAGCTAAATAAGCAGAAAAACACATCTGAAATTGCTGGGGTAAAGTGTGTCTCCATTGCTGCTAAGGAGCAGCCACCTCCATCGGTTATAGGTGCTTCATGCGTCTATGTGATGCTGAGACCTGACAAGAAACTATATGTTGGACAGACTGATGATCTCGACGGCCGAATCCGTTCTCATCGATCAAAAGAAGGAATGCAGACTGCCTCTTTCCTTTATTTCATAGTCCCGGGGAAGAGCGTAGCATGCCAAATTGAAACTCTTCTCATCAACCAGCTATATAATCAAGGCTTCCCTCTGGCCAACATTGCTGATGGCAAGCATCGGAATTTTGGCACATCAAATCAATTTGCAGAAACTTTGACTGTTCATTAA |
Protein: MYWLATRNAAVTFPKLRSLSLFLRSPVLSHTPFRPSSLFLSRRQFGQIYFFKDRKASRGITKASKKVKVSNDNVLNDKDLSHLMWWQERLQTCRKPSTLHLVKRLKYSNLLGLDVSLKNGSLKEGTLNWEMLQFKSKFPREVLLCRVGDFYEAIGIDGCILVEFAGLNPFGGLRPESIPKAGCPVVNLRQTLDDLTRNGYSVCIVEEVQGPTQARSRKSRFISGHAHPGSPYVFGLVGIDHDLDFPEPMPVVGISRSARGYCIILVLETMKTYCSEDGLTEDALVTKLRTCRYHHLFLHTSLRQNTSGTCRWGEYGEGGLLWGECNSRLFQWFEGNPVTDLLFKVKEVYGLENEVTFRNVTVPYENRPRPLYLGTATQIGAIPTEGIPSLLKVLLPSNCTGLPVLYVRDLLLNPPAYEIASTIQAIGKLMSNVTCSIPEFTCVSPAKLVKLLELREANHIEFCRIKNVLDETLHMYGNSELNEILKLLMDPTWVSTGLKIDFETFVKECECASVRIGEMISLDGESDQKISSYNGIPSDFFEDMECLWKRRVKRTHIEEEIAEVEKAAEALSLAVTEDFLPIFSRIKATTAPLGGPRGEILYAREHEAVWFKGRRFRPAVWAGTPGEEQIKQLKPAIDSKGRKVGEEWFSTMKVEDALLRYHEAGAKAKAKVLELLRGLSSELQTKINILVFASMLLVIAKALFAHVSEGRRRKWVFPTLVGFNSSETIKPLNGANSLKMIGLSPYWFDVAEGSAIHNTVDMQSLFLLTGPNGGGKSSLLRSICAAALLGICGFMVPAESALIPYLDAIMLHMKSYDSPADGKSSFQVEMSEVRSIITGATSRSLVLIDEICRGTETAKGTCIAGSIIETLDKIGCLGIVSTHLHGIFDLPLNTKNTMYKAMGTEYVDGKTKPTWKLIDGICRESLAFETAKKEGVPETIIQRAEDLYLSVYAKENSSERSDRKGSQVCSETRIDCSVEADLHFNNIGVGSVHHKIESMMTMEVLHKEINRAVTVICQKKLIELNKQKNTSEIAGVKCVSIAAKEQPPPSVIGASCVYVMLRPDKKLYVGQTDDLDGRIRSHRSKEGMQTASFLYFIVPGKSVACQIETLLINQLYNQGFPLANIADGKHRNFGTSNQFAETLTVH |